Modelling Errors in Automatic Speech Recognition for Dysarthric Speakers

نویسندگان

Santiago Omar Caballero Morales

Stephen J. Cox

چکیده

Dysarthria is a motor speech disorder characterized by weakness, paralysis, or poor coordination of the muscles responsible for speech. Although automatic speech recognition (ASR) systems have been developed for disordered speech, factors such as low intelligibility and limited phonemic repertoire decrease speech recognition accuracy, making conventional speaker adaptation algorithms perform poorly on dysarthric speakers. In this work, rather than adapting the acoustic models, we model the errors made by the speaker and attempt to correct them. For this task, two techniques have been developed: (1) a set of “metamodels” that incorporate a model of the speaker’s phonetic confusion matrix into the ASR process; (2) a cascade of weighted finite-state transducers at the confusion matrix, word, and language levels. Both techniques attempt to correct the errors made at the phonetic level and make use of a language model to find the best estimate of the correct word sequence. Our experiments show that both techniques outperform standard adaptation techniques.

برای دانلود رایگان متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

منابع مشابه

Maximum Likelihood Linear Regression (MLLR) for ASR Severity Based Adaptation to Help Dysarthric Speakers

Automatic speech recognition (ASR) for dysarthric speakers is one of the most challenging research areas. The lack of corpus for dysarthric speakers makes it even more difficult. The speaker adaptation (SA) is an alternative solution to overcome the lack of dysarthric speech and enhance the performance of ASR. This paper introduces the Severity-based adaptation, using small amount of speech dat...

متن کامل

Vocal tract representation in the recognition of cerebral palsied speech.

PURPOSE In this study, the authors explored articulatory information as a means of improving the recognition of dysarthric speech by machine. METHOD Data were derived chiefly from the TORGO database of dysarthric articulation (Rudzicz, Namasivayam, & Wolff, 2011) in which motions of various points in the vocal tract are measured during speech. In the 1st experiment, the authors provided a bas...

متن کامل

Automatic recognition of dutch dysarthric speech: a pilot study

This paper describes a feasibility study into automatic recognition of Dutch dysarthric speech. Recognition experiments with speaker independent and speaker dependent models are compared, for tasks with different perplexities. The results show that speaker dependent speech recognition for dysarthric speakers is very well possible, even for higher perplexity tasks.

متن کامل

Running Head: DIFFICULTIES IN AUTOMATIC SPEECH RECOGNITION DIFFICULTIES IN AUTOMATIC SPEECH RECOGNITION OF DYSARTHRIC SPEAKERS AND THE IMPLICATIONS FOR SPEECH-BASED APPLICATIONS USED BY THE ELDERLY: A LITERATURE REVIEW

Automatic speech recognition is being used in a variety of assistive contexts, including home computer systems, mobile telephones, and various public and private telephony services. Despite their growing presence, commercial speech recognition technologies are still not easily employed by individuals who have speech or communication disorders. While speech disorders in older adults are common, ...

متن کامل

Estimation of Phoneme-Specific HMM Topologies for the Automatic Recognition of Dysarthric Speech

Dysarthria is a frequently occurring motor speech disorder which can be caused by neurological trauma, cerebral palsy, or degenerative neurological diseases. Because dysarthria affects phonation, articulation, and prosody, spoken communication of dysarthric speakers gets seriously restricted, affecting their quality of life and confidence. Assistive technology has led to the development of spee...

متن کامل

ذخیره در منابع من

ذخیره در منابع من قبلا به منابع من ذحیره شده

{@ msg_add @}

با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید

عنوان ژورنال:

EURASIP J. Adv. Sig. Proc.

دوره 2009 شماره

صفحات -

تاریخ انتشار 2009

Modelling Errors in Automatic Speech Recognition for Dysarthric Speakers

نویسندگان

چکیده

منابع مشابه

Maximum Likelihood Linear Regression (MLLR) for ASR Severity Based Adaptation to Help Dysarthric Speakers

Vocal tract representation in the recognition of cerebral palsied speech.

Automatic recognition of dutch dysarthric speech: a pilot study

Running Head: DIFFICULTIES IN AUTOMATIC SPEECH RECOGNITION DIFFICULTIES IN AUTOMATIC SPEECH RECOGNITION OF DYSARTHRIC SPEAKERS AND THE IMPLICATIONS FOR SPEECH-BASED APPLICATIONS USED BY THE ELDERLY: A LITERATURE REVIEW

Estimation of Phoneme-Specific HMM Topologies for the Automatic Recognition of Dysarthric Speech

عنوان ژورنال:

اشتراک گذاری